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SOLUTION OF THE CONGRUENCE PROBLEM FOR 
ARBITRARY HERMITIAN AND SKEW-HERMITIAN MATRICES 
q ! OVER POLYNOMIAL RINGS 

o 

DRAGOMIR Z. BOKOVIC AND FERNANDO SZECHTMAN 

Abstract. Let * be the involutorial automorphism of the complex polyno- 
mial algebra C[t] which sends t to — t. Answering a question raised by V.G. 
Kac, we show that every hermitian or skew-hermitian matrix over this algebra 
is congruent to the direct sum of 1 X 1 matrices and 2x2 matrices with zero 
diagonal. Moreover we show that if two nxn hermitian or skew-hermitian ma- 
trices have the same invariant factors, then they are congruent. The complex 
' field can be replaced by any algebraically closed field of characteristic 2. 

■3 

1. Introduction 

Let R be the polynomial algebra F[t] in one variable t over an algebraically 
closed field F of characteristic 7^ 2. Let * denote the involution of R which is the 
' identity on F and sends t to —t. (We remark that all nontrivial i^-involutions of 

F[t] are conjugate in Aut_p(F[i]).) It induces the Z 2 -gradation R = Rq © R\ of R 
with i?o = F[t 2 ] and R\ = tR^. We shall refer to the elements of Rq (resp. Ri) as 
even (resp. odd). 

■ Let M n {R) denote the algebra of n by n matrices over R. If A = (a^ ) G M n (R), 

I we define A* to be the matrix B — (bij) G M n (R) where 6 y - = a* t . Thus * is 

now made into an involution of M n (R). We say that A £ M n {R) is hermitian 
"pH , (resp. skew-hermitian) if A* = A (resp. A* — —A). Two hermitian (resp. skew- 

hermitian) matrices A, B 6 M n (R) are said to be congruent if B = S*AS for some 
SeGL n (R). 

Not long ago V.G. Kac || posed the following question to the first author (see 
also Q i. 

// F is the complex field, is it true that every hermitian or skew-hermitian matrix 
A G M n (R) is congruent to the direct sum of 1 x 1 matrices and 2x2 matrices with 



- V— i 

X 

C3 ' zero diagonal? 



Note that no condition is imposed on the determinant of A. (The usual restriction 
is that A be unimodular.) The two cases, hermitian and skew-hermitian, of this 
problem are tightly linked because if A is hermitian then tA is skew-hermitian, and 
vice versa. 

The main objective of our paper is to give an affirmative answer to Kac's question 



(Theorem 4.3), which we find quite surprising. The case n — 2 is dealt with in 



Section 3 and the general case in Section 4. In Section 4, we also prove that two 
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hermitian (or skew-hermitian) matrices A, B G M n {R) are congruent if and only if 



they have the same invariant factors (Theorem 4.5). Then, in Section 5, we are able 
to characterize the sequence of invariant factors of a hermitian or skew-hermitian 
matrix, and to give the canonical form under congruence for such a matrix. In 
the last section we make comments on other fields and characterize those for which 
Kac's question has positive answer. 

The authors would like to thank Prof. Kac for his interest in our work and 
for proposing this interesting problem. We also thank Prof. L. Vaserstein for his 
comments on an earlier version of this paper. 

2. Preliminaries 

The elements a G R are polynomials and so they can be evaluated at any point 
A G F. We denote by a(A) the value of a at A. We say that a nonzero element 
a G R is pure if gcd(a, a*) = 1. If a, b G R with a pure and b even (resp. odd), then 
there exists x G R such that ax + a*x* = b (resp. ax — a*x* = b). (Choose y, z G R 
such that ay + a* z = 6/2 and set x = y + z* (resp. x = y — z*).) 

If I = Ra is a homogeneous (i.e., *-invariant) ideal of R, then its generator a is 
also homogeneous, i.e., it is either even or odd. If A = (a^) G M n (R) is hermitian 
or skew-hermitian, then the ideal generated by all entries is ^-invariant and we 
denote its generator by gcd(A). Hence gcd(A) is the first invariant factor of A. 

Let us fix a hermitian or skew-hermitian matrix A G M n {R). Let R n denote 
the free i?-module of rank n consisting of column vectors. We shall denote by 
ei, . . . , e„ the standard basis vectors of R n . The matrix A defines a hermitian or 
skew-hermitian form : R n x R n — > R by 

/a(v, w) = v* Aw. 

By ^ Lemma 1], A is congruent to the direct sum of a zero matrix and a 
hermitian or skew-hermitian matrix with nonzero determinant. (The proof given 
there in the hermitian case is also valid in the skew-hermitian case.) This argument 
shows that it suffices to consider only the hermitian or skew-hermitian matrices with 
nonzero determinant. 

As F is algebraically closed, if n > 2 there exist nonzero isotropic vectors, i.e., 
nonzero vectors v G R n such that fA(v,v) = 0. 

Assume that det(A) ^ and set d = gcd(^4). Then A = dB for some matrix 
B G M n (R) such that B* = ±B and gcd(S) = 1. Therefore, without any loss of 
generality, we may assume that det(A) ^ and gcd(A) = 1. 

3. The case n = 2 

In this section we show that the answer to Kac's question is affirmative if n = 2. 
We start with the hermitian case. 

Proposition 3.1. If A = A* e M 2 (R), det(A) ^ 0, and gcd(A) = 1, then A is 
congruent to diag(l, det(A)). 

Proof. Since there exist nonzero isotropic vectors, we may assume that 

a 



A 



a* b 



The element ao — gcd(a, a*) is homogeneous, i.e., a^ = ±ao. We have a factoriza- 
tion a — a$ai where a\ is pure. By the hypothesis, gcd(ao,6) = 1. Consequently, 



HERMITIAN AND SKEW-HERMITIAN MATRICES 



3 



there exist homogeneous elements x and y, with y even, such that a$x + by = 1. 
Clearly we may assume that y(0) ^ 0. Choose a factorization y = zz* such that 
a±z is pure. Then there exists w £ R such that 

aizw + a*z*w* = 1. 

Since aox is even, we find that 

1 = clqx + by 

= aQx{a\zw + a\z*w*) + bzz* 
= axwz + a* x*w* z* + bzz* 
= fA(x*w*ei+ze2,x*w*ei+ze2)- 
The assertion of the proposition is now obvious. □ 

Next we consider the skew-hermitian case. We shall need the following simple 
lemma. 

Lemma 3.2. Let a,b £ R satisfy gcd(a,b) — gcd(6, b*) = 1. Then there exist 
x, y £ R, with x even, such that ax + by = 1. 

Proof. Choose u,v £ R such that au+bv — 1 and z £ R such that bz — b*z* = u* —u. 
Then x = u + bz £ Rq and y = v — az satisfy ax + by = 1. □ 

Proposition 3.3. Let A £ M 2 (R), A* = -A, det(A) ^ 0, and gcd{A) = 1. Then 
A is congruent to a matrix with zero diagonal. 



Proof. We may assume that 

A = 



a 

-a* b 



As a(0) 0, we can write a = a\cc* with a\c pure. By Lemma 3.2, there exist 
x, d £ R, with x even, such that 

bx + cd = 1. 

By replacing (x, d) with (x + Xcc*,d — Xbc*), where A £ F is suitably chosen, we 
may assume that gcd(ai,x) = 1. Since 6 is odd, we can choose w £ R such that 
c*w* — cw = b. Since gcd(a l7 cx) = 1, there exist v,p £ R such that 



Choose q £ R such that 
and set 
Then 



drj_v — cxp = xw — d . 

a{q-aiq* = p - p* 
y = v + cxq, z = w + c*(p + a\q*). 



c*z* — cz — c*w* + cc*(p* + a\q) — cw — cc*(p + a±q*) 

= c*w* — cw = 6, 

xz* — a*y = xw* + cx(p* + a*q) — a*v — a\cxq 

= cxp* + xw* — a*v = d* . 
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Hence 



and so 



We have 



where 



OjC y — c xz — c a = c xz — (1 + ox) 
— x(c* z* — b) — 1 = cxz — 1, 



S ( "*c* ~ ) ■- :SL - ,,,/| - 



c I aic 2 \ „ _ ( r 
' ! al{c*) 2 J 6 - I -r* s 



r = a%ce*(cxz — a*c*y) = axcc* = a, 

s = a\c y z — a 1 (c ) yz 

= a\cy* ■ cz — a*c*y ■ c* z* 

— (c*xz* — l)cz — (cxz — l)c* z* 

= c z — cz = b. 



Hence 



r 

—r* s 



= A. 



□ 



4. Equivalence implies congruence 



The hrst main theorem is a simple consequence of the following two propositions. 
The first one deals with hermitian matrices. 

Proposition 4.1. If A — {a io ) e M n {R), A* — A and gcA(A) = 1, then there 
exists v £ R n such that fj^(v,v) = 1. 

Proof. We may assume that det(j4) ^ 0. The proof will be by induction on n. The 



case n = 1 is obvious. For the case n = 2 see Proposition 3.1. Thus let rt > 2. 

Since there exist nonzero isotropic vectors, we may assume that an = 0. More- 
over, we may assume that a\j = for j < n. Denote by Eij the matrix of order n 
whose (i, j)-th entry is 1 and all other entries are 0, and by the identity matrix. 
For any A 6 F, the matrix 

A x = (I n + XE 21 )A{I n + XE 12 ) 

is congruent to A. Let A' x denote the submatrix of A\ obtained by deleting the 
first row and column. Set A' = A' . Note that for A, /i s F we have 

A'x - A'^ = (A - fi)(ai n Ei,n-i + a„i-E n -i,i), 

where now 's have order n — 1. 

As det(^4) 7^ 0, we have a,. s ^ for some r,s £ {2, 3, ... ,n— 1}. Since a rs 
has only finitely many monic divisors, there exist A,/i G i* 1 , with A ^ such that 
gcd(A^) = gcd(^4JJ. Denote this common gcd by d. The displayed formula for 
A' x — A'^ shows that d divides a\ n (and a n i). It also divides all entries of A' . Since 
ay = for j < n, it follows that d divides all entries of A. As gcd(A) = 1, we 
conclude that d = 1. 
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We have shown that gcd(A' x ) = 1 for some A € F. By the induction hypothesis 
there exists w £ such that (w,w) = 1. As A and A\ are congruent, there 

exists v £ i?" such that /a(u,u) = 1. 

□ 

The second proposition is a skew-hermitian analog of the first one. We shall need 
the following definition. Let va denote the minimum degree of nonzero polynomials 
$a{v, w) over all v, w £ BP with /a{v, v) = 0. 

Proposition 4.2. If A = (fly) £ M n (R), A* = -A and gcd(A) = 1, i/ien A is 
congruent to the direct sum B © D, where 



(4.1) B = 



/ 

-/* o 



with f pure of degree va. Furthermore f f* divides all entries of D, i.e., det(-B) is 
the second invariant factor of A. 



Proof. For the case n = 2 see Proposition 3.3. Thus let n > 2 



After a suitable change of basis, we may assume that /A(ei,ei) = and that 
there exists w £ R n such that f^(ei,w) is nonzero and has degree va- Thus 
an = 0. By performing some additional elementary congruence transformations, 
we may also assume that an ^ has degree va and that a\j = for j > 2. 

Denote by A the set of all skew-hermitian matrices X = (:%) € M n (R) which 
are congruent to A and such that x\j — for j ^ 2 while x\2 has degree va- For 
X € A let dx = gcd(xi2, X21, £22) where we require dx to be monic. Let „4o denote 
the set of all X g ^4 such that has the minimum degree. Without any loss of 
generality, we assume that A G Aq. 

Our first objective is to show that 6la is 1 or t. Let 2 < r < s < n and for x £ B 
define A x E Aby 

A x = (I n + x E rs )A[I n + xE sr ) 

and set d x = d^^, . For A £ F, the (r, r)-th entry of A\ x is 

(4.2) a rr + A(a rs a:* — a* s x) + X 2 a ss xx* . 

We take first r = 2. As 012 has only finitely many monic divisors, we can choose 
distinct a, 0, 7 £ F such that d ax = dp x — d yx . Denote this common gcd by d. As 
the Vandermonde determinant of a, (3, 7 is not 0, d must divide 022, a2 S x* — a^ s x 
and a ss xx* . It follows that d divides dA, and consequently we must have d — dA- 
By taking x = 1, we infer that d^ divides the diagonal entries of A. As d^ divides 
a2s£* — ajjLx for all x £ i?, we deduce that dyi divides io^s- 



Next wc take r > 2. Since d^ must divide (4.2) for all A £ F and x £ B, we 
infer that d^ divides for all x E B. Consequently, d^ divides ia rs . As 

gcd(A) = 1, it follows that dA is either 1 or t. 

We shall now rule out the possibility dA — t. Suppose that d^ = t. Assume 
that all entries a%j are divisible by t. In the above construction, we take once again 
r = 2 and choose s > 2 such that a s k is not divisible by t for some k > 1 and k ^ s. 
As above, we can choose a nonzero A £ F such that dA = d^. Then the (2,fc)-th 
entry of A\ is not divisible by t. Hence we can assume that one of the entries in 
the second row, say 023, is not divisible by t. 
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The 3x3 submatrix in the upper left hand corner of A has the form 

at k 

-a*(-t) k bt c 
-c* d , 

where a,b,c are not divisible by t, k > 1, gcd(a,a*,6) = 1, and the degree of ai fe 
is equal to v A . Moreover, by using Proposition 3.1, we may also assume that a is 
pure. Hence we can choose x G R such that ax* + a*x = {—l) k b. Then the vector 
v = xei + t l e2 is isotropic and 

f A (v,e 1 ) = a*t 2k -\ f A (v,e 3 )=c(-t) k - 1 . 

Hence there exists w 6 R n such that 

f A {v,w) =t k - 1 gcd(a*,c), 

contradicting the fact that at k has degree v A . We conclude that d A = 1. 



It is now easy to finish the proof. By Proposition 3.3, we may assume that the 
2x2 submatrix B in the upper left hand corner of A has the form (4.1) with / 
pure. From the definition of v A it follows that the entries a\j, j > 2, are divisible by 
/. By performing suitable elementary congruence transformations, we may assume 
that all these entries are 0. A similar argument can be used to make the entries 
a 2j = for j > 2. 

Thus we have A = B(BD where D = (dj j). Replace the zero in the (2, 2) position 
of A by —da. It follows from Proposition that this change can be achieved by 
a congruence transformation on the block B. Now add the (i + 2)-nd row of A to 
the second row and then the (i + 2)-nd column to the second column. The entry 
in the (2, 2) position will become again. From the definition of v A it follows that 
the (2, j + 2)-nd entry of this new matrix must be divisible by /*. As this entry is 
equal to dij, we conclude that all entries of D are divisible by /*. As D* = —D, 
they are also divisible by /. As / is pure, all entries of D are divisible by //*. 

□ 

We are now able to answer Kac's question. 

Theorem 4.3. If A G M n (R) is hermitian or skew-hermitian, then A is congruent 
to the direct sum of 1 x 1 matrices and 2x2 matrices with zero diagonal. 

Proof. As observed in section 2, we may assume that det(^4) ^ and gcd(A) = 1. 
We already know that the theorem is true if n < 2. It remains to use induction 



and apply the Propositions 4.1 and 4.2. □ 



To prove our second main result, we need the following simple lemma. 

Lemma 4.4. Let A,BE M2(R) be skew-hermitian, gcd(A) = gcd(_B) = 1. 
det(.A) = det(-B) 0. Then A and B are congruent. 



Proof. By Proposition 3.3, we may assume that 



A=[ ° B= f ° ab * 



-a*b* J ' ^ ~a*b 

with ab and ab* pure. There exist x, y G Rq such that bb* x — aa*y = 1. If 

„ / b*x ay 
b= { a* b 

then S G SL 2 (i?) and S*BS = A. □ 
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Recall that two matrices A, A' £ M n (R) are said to be equivalent if there exist 
S, T 6 GL„(i?) such that A' = SAT. A necessary and sufficient condition for A 
and j4' to be equivalent is that they have the same invariant factors. 

Theorem 4.5. Let A, A' £ M n (R) be both hermitian or both skew-hermitian. If A 
and A' are equivalent, then they are congruent. 

Proof. We use induction on n. The case n = 1 is trivial. Let n > 1. Denote 
the invariant factors of A (and A') by /i, . . . , /„. If /„ = then we can use the 
induction hypothesis. Assume that /„ ^ 0. By dividing A and A' by /i, we may 
assume that f± = 1. 

Now if A and A' are herm itia n (resp. skew-hermitian) then Proposition [4 . l| (resp. 



Proposition 4.2 and Lemma 4.4) allows us to finish the proof by using the induction 
hypothesis. 



We shall give more details in the skew-hermitian case. By Proposition 4.2 we 



may assume that A = B © D, where B and D are as stated there. Similarly, we 



may assume that A' = B' D' . Since det(B) = /a = det(B'), Lemma 4.4 implies 
that B and B' are congruent. Since D and D' have the same invariant factors, they 
are congruent by the induction hypothesis. Hence A and A' are congruent. 

□ 

5. Canonical form under congruence 

In the next theorem we characterize the invariant factors of hermitian and skew- 
hermitian matrices. Clearly these factors have to be homogeneous. 

Theorem 5.1. Let < r < n. Let fi,-..,f n be a sequence of homogeneous 
elements in R such that /i , . . . , f r are monic, each dividing the next one, and 
fr+i, •■■ , fn are zero. Then this sequence is the list of invariant factors of a her- 
mitian (resp. skew-hermitian) matrix A € M n (R) of rank r if and only if the 
following two conditions hold: 

(i) Any maximal subsequence /,-, /j+i, . . . , fj consisting of consecutive nonzero 
odd (resp. even) elements has even length. We shall write such subsequence 
as 

9i, hi,g. l+2 , h. l+2 , . . . , gj-i,hj-\. 

(ii) For each (gk,hk) as above, hk = guPkP*k with pk pure. 

Proof. We prove necessity by induction on n. The cases r = and n = 1 are trivial. 
Let r > 1 and n > 2. By replacing A with f± A, we may assume that f\ = 1. 



If A is hermitian, then Proposition 4.1 shows that A is congruent to (1) © B and 



we can apply the induction hypothesis to B to finish the proof. 

If A is skew-hermitian, then A is congruent to the matrix B © D as stated in 



Proposition 4.2. In particular / 2 = dct(S) is even and not divisible by t. We can 
now finish the proof by applying the induction hypothesis to D. 

Sufficiency can be read off from the next theorem. □ 

It is now easy to obtain the canonical forms for hermitian and skew-hermitian 
matrices under congruence. 

Theorem 5.2. Let A G M n {R) and A* — eA, where e = ±, let r be the rank of 
A, and let f%, . . . , f n be the invariant factors of A. Form the direct sum, B, of the 
following blocks: 
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(i) The lxl matrices (/*) /or eac/i /, suc/i that f* — efi. 

(ii) The 2x2 matrices 

p fc 

5fe 



ePfe 

/or eacft pair (g k , h k = gkPkV*k) constructed in the previous theorem. 
Then A is congruent to B. Moreover such B is unique up to the ordering of the 
diagonal blocks and the factorizations h k = fkPkPk ■ 

Proof. The matrices A and B have the same invariant factors. 

□ 

6. Comments on other fields 

We introduce four conditions on a field F assuming only that the characteristic 
is not 2. 

(K) Kac's question has affirmative answer for the field F. 
(N) The norm map R — > Rq sending x — > xx* is onto. 
(U) The quadratic form x 2 — ty 2 over R is universal. 
(I) No element of Rq is irreducible in R. 

Proposition 6.1. For a field F of characteristic ^ 2, the above four conditions 
are equivalent to each other. 

Proof. (K) => (N). Let a e f , a / 0. As 

. / —at a 
\ —a t 

is skew-hermitian but not diagonalizable, it must be congruent to 

x 

-x* 

for some x S R. Hence det(A) = —a(t 2 — a) splits over F. We deduce that F is 
quadratically closed, i.e., it has no quadratic extensions. 

It remains to show that if a = 1 + t 2 b, with b £ i? , then a = xx* for some x £ R. 
This follows by applying the above argument to 

tb 1 

-1 t 

(N) (K). Our proofs are valid under this weaker hypothesis. 

(N) (U). Let g '. R — ► Rq be the isomorphism of i^-algebras sending t to 
t 2 . For 6 £ R we have a(b) — zz* for some z £ R. By writing z = a(x) + ta(y), 
(x, y £ R), we obtain a(b) = a(x) 2 - t 2 a(y) 2 , i.e., b = x 2 - ty 2 . 

(U) (N). For a £ R we have a = a(b) with b £ R. As b = x 2 - ty 2 for some 
x,y £ R, we have a = zz* with z — o~(x) + to~(y). 

The equivalence between (N) and (I) is obvious. □ 

One can construct examples of fields F satisfying the above conditions without 
being algebraically closed. Start with a finite Galois extension K/E whose Galois 
group is not a 2-group. Let a be an E- automorphism of an algebraic closure K 
of K whose restriction to K is nontrivial and has odd order. Then one can take 
F to be the quadratic closure of (K) . In particular the quadratic closure of the 



HERMITIAN AND SKEW-HERMITIAN MATRICES 



9 



prime field F p (p odd) is an example. On the other hand, it is easy to see that the 
quadratic closure of the rationals does not satisfy the condition (U) . 

In general, a hermitian or skew-hermitian matrix A 6 M n (R) need not be con- 
gruent to the direct sum of any 1 x 1 or 2 x 2 matrices. For instance, this is the 
case when F is the real field and 

A = 
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